Replacing uncertainty decoding with subband re-estimation for large vocabulary speech recognition in noise

نویسندگان

  • Jianhua Lu
  • Ji Ming
  • Roger F. Woods
چکیده

In this paper, we propose a novel approach for parameterized model compensation for large-vocabulary speech recognition in noisy environments. The new compensation algorithm, termed CMLLR-SUBREST, combines the model-based uncertainty decoding (UD) with subspace distribution clustering hidden Markov modeling (SDCHMM), so that the UD-type compensation can be realized by re-estimating the models based on small amount of adaptation data. This avoids the estimation of the covariance biases, which is required in model-based UD and usually needs a numerical approach. The Aurora 4 corpus is used in the experiments. We have achieved 16.9% relativeWER (word error rate) reduction over our previous missing-feature (MF) based decoding and 16.1% over the combination of Constrained MLLR compensation and MF decoding. The number of model parameters is reduced by two orders of magnitude.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining noise compensation and missing-feature decoding for large vocabulary speech recognition in noise

In this paper we propose a combination of noise compensation and missing-feature decoding for large-vocabulary speech recognition in noisy environments. Two approaches for noise compensation have been studied. These are noise training and vector Taylor series expansion, aiming to compensate white Gaussian noise at various levels. This is followed by subband missing-feature decoding to reduce th...

متن کامل

Joint Uncertainty Decoding for Robust Large Vocabulary Speech Recognition

Standard techniques to increase automatic speech recognition noise robustness typically assume recognition models are clean trained. This “clean” training data may in fact not be clean at all, but may contain channel variations, varying noise conditions, as well as different speakers. Hence rather than considering noise robustness techniques as compensating clean acoustic models for environment...

متن کامل

Joint Uncertainty Decoding for Noise R

Background noise can have a significant impact on the performance of speech recognition systems. A range of fast featurespace and model-based schemes have been investigated to increase robustness. Model-based approaches typically achieve lower error rates, but at an increased computational load compared to feature-based approaches. Thismakes their use inmany situations impractical. The uncertai...

متن کامل

On the Estimation and Use of Feature Reliability Information for Noise Robust Speech Recognition

In this paper we present an Uncertainty Decoding rule which exploits feature reliability information and interframe correlation for noise robust speech recognition. The reliability information can be obtained either from conditional Bayesian estimation, where speech and noise feature vectors are tracked jointly, or by augmenting conventional point estimation methods with heuristics about the es...

متن کامل

Uncertainty Decoding for Noise Robust Automatic Speech Recognition

This report presents uncertainty decoding as a method for robust automatic speech recognition for the Noise Robust Automatic Speech Recognition project funded by Toshiba Research Europe Limited. The effects of noise on speech recognition are reviewed and a general framework for noise robust speech recognition introduced. Common and related noise robustness techniques are described in the contex...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009